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ERGODIC OPTIMIZATION OF PREVALENT 
SUPER-CONTINUOUS FUNCTIONS 

JAIRO BOCHI AND YIWEI ZHANG 


Abstract. Given a dynamical system, we say that a performance function has 
property P if its time averages along orbits are maximized at a periodic orbit. 
It is conjectured by several authors that for sufficiently hyperbolic dynamical 
systems, property P should be typical among sufficiently regular performance 
functions. In this paper we address this problem using a probabilistic notion 
of typicality that is suitable to infinite dimension: the concept of prevalence 
as introduced by Hunt, Sauer, and Yorke. For the one-sided shift on two 
symbols, we prove that property P is prevalent in spaces of functions with a 
strong modulus of regularity. Our proof uses Haar wavelets to approximate 
the ergodic optimization problem by a finite-dimensional one, which can be 
conveniently restated as a maximum cycle mean problem on a de Bruijin graph. 


1. Introduction 

1.1. Are optimal orbits typically periodic? Consider a continuous transfor¬ 
mation T: A ^ A on a compact metrizable space, and a continuous “performance” 
function /: A —^ IR. The main questions of ergodic optimization are: Along which 
orbits of T are the time averages of / largest? What is the statistics of those 
orbits? Equivalently, for which invariant probability measures p is average perfor¬ 
mance J f dp maximal? What properties do these so-called maximizing measures 
have? What happens as / changes? The subject was nicely surveyed by Jenkinson 
[Je3]. Some maximizing measures of special interest in thermodynamic formalism 
are the so-called ground states: see [BLL]. 

If the dynamical system T has few invariant measures then the questions above 
become uninteresting. So it is natural to focus the study on chaotic systems, as 
for example uniformly hyperbolic or uniformly expanding ones. These systems 
possess many invariant probability measures, the simplest of which are the periodic 
measures (i.e. those supported on periodic orbits). Let us say that the performance 
function satisfies property P with respect the dynamical system T if it has a periodic 
maximizing measure. 

Based on experimental results and heuristic arguments, Hunt and Ott [HO] 
conjectured that for typical chaotic systems and typical parameterized families of 
smooth functions, property P holds for most values of the parameter. Here “most” 
means full Lebesgue measure in the parameter space (but the meanings of “typical” 
are not specihed). One of the examples studied by these authors consisted on the 
doubling map T{6) = 26 on the circle U/2ttZ together with the family of functions 
formed by linear combinations of cosines and sines. For this specific case, the 
conjecture was later confirmed by Bousch [Bolj. (See also [Jel].) 
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A different conjecture by Yuan and Hunt [YH] is that for every uniformly hy¬ 
perbolic or uniformly expanding map, property P holds for topologically generic 
performance functions, i.e., it holds for every function in a dense Gs subset. As 
these authors observe, the interior of property P is dense in itself, so if the con¬ 
jecture is true then property P actually holds for an open and dense subset of the 
space of functions. 

Regularity assumptions are important. Assume that T is a “hyperbolic” (e.g., 
expanding) map. Then maximizing measures of topologically generic continuous 
function have full support (see [Bo2, .Tc3, ]Mo2]); in particular the Yuan-Hunt con¬ 
jecture fails dramatically in the space of continuous functions. On the other hand, 
every sufficiently regular function (e.g., a Holder one) has the following agreeable 
property: there exists a compact invariant set K such that an invariant probability 
measure /x is maximizing if and only if fi{K) = 1. This fact and related ones (espe¬ 
cially the so-called Mane lemma) were discovered independently by many authors 
in various settings: see [Bo2, Bo3] and references therein. 

Much research was done about property P and related ones on spaces of suffi¬ 
ciently regular functions: [YH, CLT, Bo2, BQ, Mol, QS]. We highlight the recent 
work of Contreras [Col], who showed that this property is open and dense among 
Lipschitz functions when the dynamics is expanding. 

Let us also mention that many of the problems and results we have alluded to 
were originally formulated in the context of classical mechanics, where optimization 
is naturally related with the principle of least action: see [Cl] and references therein. 
There is a celebrated conjecture by Mane [Ma] on the genericity of property P in 
those contexts, which was proved very recently by Contreras [Co2]. 

In this paper we further investigate the genericity of property P in abstract 
ergodic optimization, not in the topological sense but in a probabilistic one, thus in 
a spirit more akin to [HO]. The first step is to define “probabilistic genericity” in 
a space of functions. Though a possible approach for this is suggested in [HO], we 
follow another route which seems more appropriate. 

Probabilistic genericity should have something to do with having full measure. 
Unfortunately, topological vector spaces do not have any natural reference mea¬ 
sures, except when they are finite-dimensional. Worse, there is no natural equiva¬ 
lence class of measures. On the positive side, what does exist (under minor tech¬ 
nical assumptions) is a natural class of “fat” subsets, which corresponds in the 
finite-dimensional case to the class of subsets of full Lebesgue measure. This is 
the class of prevalent sets discovered by Hunt, Sauer, and Yorke [HSY], though 
it was previously introduced in a slightly less general setting by Christensen [Ch]. 
The surveys [OY, HK] contain much information on the subject, including gener¬ 
alizations, applications, and open problems in many areas - especially Dynamical 
Systems. 

In general, prevalence and topological genericity are neither one stronger than the 
other; indeed the references above contain examples of many properties of natural 
interest that are generic in one sense but not in the other. However, for properties 
known to be open (or to contain an open dense subset), topological genericity means 
only denseness, which is always weaker than prevalence. 

Thus we are inevitably led to the the following question: Does a prevalent ver¬ 
sion of the theorem of Contreras [Col] hold? In other words, if the dynamics is 
expanding, is property P prevalent among Lispchitz functions? 
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Though being currently unable to solve the problem in full generality, we do 
answer the question positively under further assumptions. First, we deal with 
the simplest interesting dynamics in this context: the full one-sided shift on two 
symbols. Second, we work on spaces of super-continuous functions, i.e. functions 
that have a very strong modulus of regularity. The setting comes from the work of 
Quas and Siefken [QS], who proved denseness of property P in some of these spaces. 

Let us proceed with the exact definitions and statements. 

1.2. Precise statements. Let F be a completely metrizable vector space. Fol¬ 
lowing [HSY], we say that a Borel set S' C F is shy if there exists a compactly 
supported Borel probability measure fi such that 

^(S -I- u) = 0 for every w € F. 

In that case we say that p, is transverse to S} 

It is easy to see that if a set S is shy then it has measures transverse to it 
with support of arbitrarily small diameter. Therefore, given any element of F it is 
possible to add to it a small random perturbation so that it falls outside of S with 
probability 1. 

A set whose complement is contained in a shy set is called a prevalent set. Preva¬ 
lence has the following properties (see [HSY]): it is preserved under translation, 
augmentation, and countable intersections, it implies denseness, and it coincides 
with the notion of having full Lebesgue measure when the ambient space is finite¬ 
dimensional. 

Next, let us state more formally the setting for ergodic optimization. Let (X, d) 
be a compact metric space, and T: X —>■ X be a continuous map. Let Mt denote 
the set of T-invariant Borel probability measures. If /r G AIt is supported on a 
periodic orbit then it is called a periodic measure. Let C°(X) denote the space of 
real-valued continuous functions on X. The ergodic supremum of / G C^{X) is 
defined as 

(1.1) ergsup(/) := sup(/,/i), 

flGAiT 

where angle brackets denote integration. A measure fj, G Mt where this supremum 
is attained is called a maximizing measure for /. By continuity and compactness, 
maximizing measures always exist. The ergodic supremum can also be expressed 
in terms of maximal time averages of /: see [Jc3]. If one of these is periodic then 
we say that / has property P. 

Let C'^'P(X) C C°(X) be the set of Lipschitz functions, endowed with the Lip- 
schitz norm ll-jlLip := IMIoo + Lip('), i.e. the sum of the supremum norm and the 
least Lipschitz constant. Then C'^'p(X) is a Banach space, and it is nonseparable 
unless X is countable, because it contains the discrete subset {d{x, ■) ; x G X}. 

We say that a function / G C'^'p(X) has the locking^ property if it has a unique 
maximizing measure p that is periodic (in particular, / has property P) and more¬ 
over pL is the unique maximizing measure for every function sufficiently close to / in 
the Lipschitz norm. We can also say that / and all functions close to it are locked 
to the measure p.. Of course, the locking property depends on the metric d. 

^If the space V is separable then the compactness hypothesis is basically avoidable. However 
we will deal with non-separable spaces. 

^This is a free translation of the verrouillage term used in [Bo2, §8]. 
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The set of functions with the locking property is exactly the interior of the set 
of functions with property P: one inclusion is obvious from the definition, and the 
other was shown by Yuan and Hunt [YH, Remark 4.5]^. In particular, the locking 
property is dense among the functions with property P. We will henceforth focus 
our attention on the locking property. 

We work in the symbolic space H := {0,1}^, where N includes 0. Suppose 
a = (an)neN is a strictly decreasing sequence of positive numbers converging to 
zero, or in symbols a 0. Then we define a metric da (actually an ultrametric) 
on n as follows: given two sequences x = (Xn), y = (jjn) S if they are different 
we set da{x,y) := am where m is their first disagreement (i.e., the least position 
where the symbols differ), otherwise we set da{x,y) := 0. All metrics of this form 
induce the product topology on fl. 

Let C°‘{n) := (7^'P(fl) where is endowed with the metric da', in other words 
C“(H) is the space of functions on that are Lipschitz with respect to the metric 
da- As explained above, this is a non-separable Banach space. 

Of course, the faster the sequence a decays, the more regular are the functions in 
C“(H). As an example, consider geometric sequences (0")„gN with factor 9 G (0,1); 
then / is a-H61der with respect to the metric if and only if / G 

Let us consider ergodic optimization on the function spaces with respect 

to the shift dynamics a: H —>■ fl. Let 

La C C“(fl) 

denote the set of da-Lipschitz functions with the locking property. 

It was proved by Quas and Siefken [QS] that if the sequence a = (ara)ngN is 
such that lima„+i/an, = 0 then La is dense (and thus topologically generic) in 
C“(H). Contreras [Col] obtains the same conclusion under the weaker condition 
supa„+i/a„ < 1. (Actually, Quas and Siefken deal with one-sided shifts in an 
arbitrary finite number of symbols, while Contreras deals with arbitrary expanding 
maps.) 

Our aim is to replace denseness by prevalence in this kind of results. However we 
need an even stronger modulus of regularity. Let us say that a sequence a = (a„)„gN 
is evanescent if a \ 0 and 

(1.2) ^ = 0(2-2 "+^). 

In this paper, we prove the following: 

Theorem A. If a is an evanescent sequence then La is a prevalent set in C“(n). 

Since the set La is open by definition, and every prevalent set is dense, we 
reobtain its topological genericity under the strong assumption (1.2). We stress 
that our methods are very different from those of [QS, Col], however. 

Currently we are not able to weaken the evanescence condition (1.2) considerably. 
The difficulties will be discussed later. On the other hand, our methods should 
be easily extendable to shifts on an arbitrary finite number of symbols or more 
generally to subshifts of finite type. Favoring clarity at the expense of generality, 
we decided to work with shifts on 2 symbols only. This choice has the additional 

^Actually this exact statement is not proved in the paper [YH] but it follows by an adaptation 
of the arguments. The interested reader may try to figure out the details himself or check our 
note WWW. mat.uc.cl/~jairo.bochi/docs/lock.pdf . 
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advantage of being able to express the random perturbations leading to the proof 
of Theorem A in a quite convenient way. 


1.3. The random perturbations. Let us describe explicitly certain probability 
measures that are transverse to the complement of L^. Therefore adding to any 
given function in a random perturbation chosen according to one of those 

probability measures one almost surely obtains a function in L^. 

Let n* denote the set of all finite words in the letters 0, 1, including the empty 
word. Each w G determines a cylinder [w] which is the set of elements of 17 that 
have (jj as an initial subword. The level of the cylinder [tc] is defined as the length 
of the word w, which is denoted by |a;|. 

For each w G 17*, we define the Haar function: 


(1.3) 


h,., := 


X[a;0] X[a;l] 
2 


where x denotes characteristic functions and wO, wl denote concatenated words. 
These functions are a straightforward adaptation of the classical Haar functions 
on [0,1] (see e.g. [Pi, §6.3]). Working on the Cantor set 17 has an advantage over 
working on the interval [0,1]: our Haar functions (1.3) are continuous. Our unusual 
choice of the normalization makes subsequent formulas simpler. 

Let us mention a few facts about these functions, leaving the details for §2. 
Haar functions, together with the constant function 1, form an orthogonal set with 
respect to a natural inner product. An arbitrary continuous function / G (7°(17) 
can be uniquely represented as a Haar series: 


f = c(/) + > 

* 


where c(/), Ci^{f) are called the Haar coefficients of /. 

A gauge is a family b = (6cj)cjgn* of positive numbers indexed by words. Suppose 
a = (a„) is an evanescent sequence in the sense (1.2). We say that a gauge b — {b^^) 
is admissible with respect to a if 

(1.4) bn = o(a„) and log(a„/6„) = 0{n). 


where 


bn '■= max buj and b„ := min b^j 

\oj\=n \uj\=n 


In other words, b^j becomes much (but not superexponentially) smaller than an as 
n = jtLij —>■ oo. Examples of admissible gauges are b^j = and b^^ = 2“"’a„. 


Lemma 1.1. Suppose that a = {an) is an evanescent seguence, and that b = {b^) 
is an admissible gauge with respect to a. Given numbers c^j satisfying \cuj\ < buj for 
all ct) G 17*, the Haar series X)i.c;Gn* represents a unique function in C°’{fl). 

Moreover, the functions obtained in this way form a compact subset "Hb of C°‘{GL). 


The set of functions Hlb as in the lemma above is called the Hilbert brick with 
gauge b. By means of the Haar coefficients, it can be identified with the prod¬ 
uct space Owen* endow each interval [—bunb^^] with the normalized 

Lebesgue measure, take the product probability, and using the identification obtain 
a probability measure on the Banach space C“(17) supported on the compact 
subset 'Hb- 

The main technical result of this paper is the following: 
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Theorem B. Suppose that a is an evanescent sequence and that b is an admissible 
gauge with respect to a. Then for any fo G C°'{Q), 

^*6 ( {5 S Tib ;/o + <?£La})=l. 

The conclusion of the theorem is that the measure Pb is transverse of the comple¬ 
ment of the set La. In particular La is prevalent. So Theorem B implies Theorem A. 

Remark 1.2. Based on experimental evidence, Hunt and Ott [HO] also conjecture 
that for typical parameterized families of functions, the Lebesgue measure of the pa¬ 
rameters corresponding to maximizing measures with period p or greater decreases 
exponentially with p. Using our methods it is straightforward to construct spaces 
of Lipschitz functions and measures on Hilbert bricks (which can be regarded as 
families on infinitely many parameters) such that the measure of the parameters 
corresponding to maximizing measures with period p or greater decays faster than 
a given function of p. Such result does not seem especially interesting, so we will 
not provide details. 

1.4. Main ideas of the proof of Theorem B and organization of the paper. 

Due to high regularity, the functions considered in this paper are well-approximated 
by step functions. Equivalently, their Haar coefficients decay fast. The strategy of 
the proof is to show that with high probability the maximizing measure for the 
function / is the same as for a step function obtained by a truncation of the Haar 
series. 

The paper is organized as follows: In Section 2 we prove a few simple facts about 
Haar series and the compactness of Hilbert bricks (Lemma 1.1). In Section 3 we 
study in detail some finite-dimensional projections of the set of invariant measures 
(rotation sets). We describe these sets in graph-theoretical terms. This informa¬ 
tion is basic to the arguments that follow. In Section 4 we obtain a sufficient gap 
condition for the locking property; it is basically a transversality condition in terms 
of finitely many Haar coefficients. In Section 5 we use evanescence (1.2) and ad¬ 
missibility (1.4) to show that with probability 1 the transversality condition will be 
satisfied at some level, thus proving Theorem B. The final Section 6 contains a few 
open questions and discusses the obstacles to improving the evanescence condition. 

2. More about Haar series and Hilbert bricks 

In this section we collect a few facts about Haar series and we prove Lemma 1.1. 

We fix some notation. Let x = (xg, xi,...) and y = (j/Oj Vi, ■ ■ ■) be two elements 
of H = {0,1}'^. Let a: t y denote the position of first disagreement between the 
sequences x and y, that is, the least m S N such that Xm 7 ^ ym, with the convention 
x \ X = 00 . The following properties hold: 

2 ^ 12/= 2/1 2 : (symmetry), x ] z >m.vn(x \ y,y \ z) (ultrametric property). 
For each n S N, we define the n-th variation of a function /: H ^ D? as: 

( 2 . 1 ) var„(/) := sup |/(a:) -/(y)| . 

x'\y'>n 

If the function / satisfies var„(/) = 0, i.e., it is constant in each cylinder of level n, 
then it is called a step functions of level n. Such functions form a vector space Sn 
of dimension 2 ”. 
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Recall the definition (1.3) of the Haar functions han where w G fl*. For each 
n G N, the set {1} U {hi^ ; |a;| < n} forms a basis of the vector space Sn- 

Let p be the unbiased Bernoulli measure on i.e. the probability measure that 
assigns equal weights to all cylinders of the same level. Let LP{P) denote the Hilbert 
space of functions that are square-integrable with respect to the measure p. Then 
the set {1} U ; w G H*} is an orthogonal basis of L'^{P). Thus every / G L‘^{P) 
can be represented by a Haar series: 


/ = c(/)+ ^ (equality in L2(/3)), 

where the Haar coefficients are defined as: 

(2.2) cif):=Jfdp, c^(/) := 21-1+2 d/3, 

The n-th approximation of / G L'^(P) is defined in any of the following equivalent 
ways: 

• the projection of / on the subspace Sn along its orthogonal complement; 

• the sum of the truncated Haar series: 


(2.3) Anf :=c{f)+ 

\uj\<n 

• the function obtained by averaging / on cylinders of level n: 


(2.4) 


Anf= Y. ( 2 ^ 

|a;|=n \ 


/ d/3 . 


As a consequence of the last characterization we have, in terms of notation (2.1), 


11 / - A„/||oo < var„(/), 


which converges to 0 as n —>■ oo if / is continuous. Therefore every continuous 
function / can be written as a uniformly convergent series: 

/ 

OO 

f = c{f) + Y 

TL —0 

\|a;|=n 

which, with some abuse of language, we also call Haar series. 

Given a sequence of coefficients, when do they form the Haar coefficients of a 
continuous function? Note that, as an immediate consequence of definition (2.2), 


\ 

/ 


(2.5) |cc.(/)| < var|,^|(/); 

therefore a necessary condition for the continuity of a function is that its Haar 
coefficients tend to 0. The next proposition gives a sufficient condition: 


Proposition 2.1. Given a family of numbers (c;^)^gn*, denote c„ := sup|^|^„ \cuj\- 
IfECn < OO then the Haar series 
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is normally convergent and defines a continuous function f such that 

(2.6) ||/||oo<|c| + i^c„, 

n 

(2.7) var„(/)<^c/c Vn £ N. 

k^n 

Proof. Note that if ui and u)' are two different words of same length then the corre¬ 
sponding Haar functions h^, ha;' have disjoint support. Using this fact, the normal 
convergence claim and estimate (2.6) follow. Let us proceed to prove (2.7). Fix 
distinct points x, y € and estimate 

OO 

\f{x) - f{y)\<'^Ck \ha;{x) - ha;{y)\. 

k—0 |lj|— fc 

Let n := X ^ y (the position of first disagreement between x and y). Note that: 

• if fc < n then all terms in the inner sum are 0; 

• if fc = n then the inner sum contains exactly one nonzero term, namely the 
one corresponding to uj = Xq ■ Xn-i = Vo ■ ■ ■ Vn-i (which by convention is 
the empty word when n = 0), whose value is 1; 

• if fc > n then the inner sum contains exactly two nonzero terms, namely the 
ones corresponding to tv = xq • ■ • Xn-i and uj = yo ... yn-i, both of which 
have value 1/2. 

Therefore we obtain that \f{x)—f{y)\ < completing the proof of (2.7). □ 


Now let us specialize to the Banach spaces C“(U) considered on §1.2. Fix a 
sequence a — (an) \ 0. Note that the least Lipchitz constant Lip„(.) with respect 
to the metric da can be expressed in terms of the variations ( 2 . 1 ) as 


( 2 . 8 ) 


LiPa(/) = sup 

neN 


varn(/) 

Q-n 


By (2.5), the Haar coefficients of any / £ C“(U) satisfy the bounds: 


(2.9) \ca;{f)\ < a„Lip„(/) for all w £ fl* with |a;| = n. 


Conversely, if the coefficients (c^j) in Proposition 2.1 satisfy J2k>n = 0(a„) then 
they are the Haar coefficients of a function / £ C'“(H). 

Given a gauge b — {ba;)u;^n* (i-e., a family of positive numbers indexed by words) 
such that 


( 2 . 10 ) 




< CX) , 


where •= sup boj, 

\uj\—n 


the Hilbert brick with gauge b is: 

(2.11) *776 := < Y ; |ca;| < boj 

Then Plb is a well-defined set of continuous functions, as an immediate application of 
Proposition 2.1. By the observations above, if X)fc>n = 0{an) then ■K 5 C (^“(H). 
We have the following compactness criterion: 



Proposition 2.2. = o(a„) then the Hilbert brick Pit is a compact subset 

of the Banach space C°'{n). 



ERGODIC OPTIMIZATION OF PREVALENT SUPER-CONTINUOUS FUNCTIONS 


9 


Proof. The product topology on Q := Otjen* makes it a compact space. 

Using properties (2.6), (2.7), and (2.8), together with the assumption on the series 
^ bk , one checks that the map 

€ Q 1—^ y ] G (^“(fl) 

is continuous. Therefore its image TYf, is compact. □ 

Proof of Lemma 1.1. If a = {an) is an evanescent sequence then J2k>n “ 0{an). 
So if b = {buj) is an admissible gauge with respect to a then J2k>nbk = o{an). 
Applying Proposition 2.2 we conclude that 'Hf, is a compact subset of C°’{Li). □ 

As the reader will have noticed, the second admissibility condition in (1.4) is 
not needed for the validity of Lemma 1.1. It will be fundamental for the proof of 
Theorem B, however. 

3. Finite-dimensional ergodic optimization 

3.1. Rotation sets. As in §1.2, consider a general continuous transformation T 
of a compact metric space X. The set Air of invariant probabilities is a compact 
convex subset of the space A1±(A) of all finite signed Borel measures, endowed 
with the usual weak topology. 

Let F C C^{X) be a finite-dimensional space of continuous functions. Given a 
basis {/i, ...,/„} of F, consider the continuous linear map 

(3.1) tt: A(±(A)IR" defined by 7r(^) = ((/i, ^), ...,(/„, ^)) . 

A change of basis produces a conjugate map tt. The image of Mt under tt is a 
convex compact subset of R", called a generalized rotation set, or simply a rotation 
set. The terminology is not especially good, but since it seems to be established in 
the literature [Zi, Jc2, KW ] we will conform with it. The prime example is when 
(/i,..., /„) is the displacement vector of a map of the n-torus homotopic to the 
identity. An example which is much more related to the theme of this paper is the 
Bousch’s “fish” [Bol], where the dynamics is the doubling map T{9) = 26 on the 
circle R/27rZ and F is spanned by the functions cos 6* and sin0, so the projection tt 
consists on computing two Fourier coefficients. 

It is shown in [KW] that arbitrary compact convex sets in R" appear as rotation 
sets of shift transformations. On the other hand, a situation where rotation sets 
are relatively simple was uncovered by Ziemian [Zi]: if T is a transitive subshift 
of finite type and F is composed by step functions constant on cylinders of level 2 
then the associated rotation set is a polytope. 

Let us say that a rotation set R = tt{Mt) has the injectivity property if for 
each extreme point p of i? there exists an unique p G Mr such that 7r(/i) = p. 
Examples are given by Ziemian’s polytopes (where the preimages of extreme points 
are certain periodic measures) and Bousch’s fish (where the preimages are the 
Sturmian measures). 

Let us discuss the relevance of rotation sets in ergodic optimization. The map tt 
has the following obvious projection property, for all p, p. G At±(A), 

(3.2) tt{p) = Tr{p) <^=> (/, p) = (/, p) for all / € F. 
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Moreover, tt is onto IR" (since the functions /i, ..., fn are linearly independent). 
Using these facts, we can define a bilinear map {(•, •)): F x IR" ^ IR, by 

(3.3) ((/)^)) •= (/)M) where /x is such that 7r(/x) = v. 

In particular, the ergodic supremum (1.1) of a function f € F can be expressed as: 

(3.4) erg sup / = sup((/, v)). 

vGR 

Therefore R and ((•,•)) encode basically all the information necessary to do ergodic 
optimization for functions in F, which becomes a finite-dimensional problem. In 
the particular case that i? is a polytope, the computation of erg sup / becomes a 
linear programming problem. If the supremum in (3.4) is attained at a unique point, 
which is often the case, and R happens to have the injectivity property, then the 
original ergodic optimization problem (1.1) has a unique solution, i.e., the function 
f G F has a unique maximizing measure. 

3.2. A family of rotation sets for the shift. Let us focus on the shift trans¬ 
formation cr on U = {0,1}'^. For each n G N, consider the vector space of step 
functions of level n, which has dimension 2"’. The characteristic functions of cylin¬ 
ders of level n, ordered lexicographically, form a basis of Sn- Let 7r„ : Al±(n) — >■ IR^ 
be the corresponding projection map, as defined in the previous subsection, that is, 

TToip) ■■= (Ax(fl)), 

Ml^) ■= ( m ([ 0 ]), m ([ 1 ])), 

MfJ-) ■= (Ai([00])>M([01]),li([10]),M([ll])), etc. 

If /X is a probability measure then 7r„(/x) is a subset of the unit simplex We 

define a rotation set 

Rn ■— • 

Proposition 3.1. The rotation sets Rn are polytopes, and satisfy the injectivity 
property. 

The proposition (or at least its first part) can be deduced from the aforemen¬ 
tioned result of Ziemian [Zi]. It is also related to work by Fuks [Fu]. Since we will 
need finer information about the sets we will provide a self-contained proof. As 
in [Zi], we will use some elementary graph theory. 

3.3. Preliminaries on convexity and graph theory. Let us review some ter¬ 
minology of convex analysis. Let C be a closed convex subset of Euclidian space. 
A subset (7 C C is called a face of C if any segment in C whose relative interior 
intersects C is actually contained in C. The proper faces of C are those different 
from 0 and C. If a singleton {x;} is a face of C then v is called an extreme point 
of C. If (7 has finitely many extreme points then they are called vertices, and (7 is 
called a (convex) polytope. The faces of a polytope are also polytopes; among them, 
the 1-dimensional faces are called edges, and proper faces of maximal dimension are 
called facets. 

A closed convex set is called a (convex) polyhedron if it is an intersection of 
finitely many affine-half spaces. A fundamental theorem [Sc, Corollary 7.1c] states 
that bounded polyhedra are polytopes, and vice-versa. 
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Next let us fix some graph theory terminology. Let G be a directed graph, 
composed of finitely many nodes and arcs.'^ We will always assume that no two 
arcs have the same source and target nodes. More formally, G is a pair {N,A), 
where fV is a finite set and A C N x N. A directed graph G' = {N',A') is a 
subgraph of G if A^' C and A' C A. 

A walk in G is a finite sequence of the form noOini ■ • ■ ni-ia^ni where each at is 
the arc from the node Ui-i to the node The positive integer (, is called the length 
of the walk. A closed walk is an equivalence class of walks with equal ending and 
starting nodes, where n^aiUi ■ ■ • ni-iaino is equivalent to 71102112 ■ • • Ti^-iOfTioOini. 
A closed walk with no repeated nodes (and consequently, no repeated arcs) is called 
a cycle. 

A circulation on G is a function that attributes a weight to each arc of G in such 
a way that: 

• weights are nonnegative; 

• the total weight is 1; 

• the total weight entering any node is equal to the total weight leaving that 
node (Kirchhoff’s current law). 

The support of a circulation is the least subgraph where all weights are positive. 

The set of functions on the arcs of G can be identified with some One 
such function is a circulation if it satisfies certain linear equalities and inequalities; 
therefore the set of circulations is a polyhedron. Moreover, it is obviously bounded, 
so it is a polytope. It is called the circulation polytope of G. 

Proposition 3.2. Let G be a directed graph, and let C be its circulation polytope. 
Then the faces of C are the circulation polytopes of the subgraphs of G, and the 
vertices of C are the circulations supported on the cycles of G. 

As a corollary, every circulation on G is a convex combination of circulations 
supported on cycles. 

Proof. If a polytope is expressed in terms of linear equalities and inequalities, then 
its faces can be obtained by setting some of the inequalities to equalities: see [Sc, 
p. 101]. The inequalities in our case are “weights are nonnegative”. Forcing some 
of the arcs to have zero weights gives rise to circulations in a subgraph G' with the 
same set of nodes as G. Removing unconnected nodes from a graph does not affect 
the possible circulations on it. Therefore for every subgraph G' of G, its circulation 
polytope G' is a face of G, and all faces appear in this way. Note that G' ^ 0 if 
and only if G' contains a cycle. Moreover, each cycle supports a unique circulation, 
and different cycles support different circulations, so G' is 0-dimensional if and only 
if G' contains a single cycle. □ 

3.4. Graphical itineraries. For each integer n > 1, the de Bruijn graph G„ is 
the directed graph defined as follows: 

• the nodes of G„ are exactly the words of length n— 1 in the alphabet {0,1}; 

• the arcs of Gn are exactly the words of length n in the alphabet {0,1}; 

• for each word oj of length n, the source node and the target node of the arc 
w are respectively its initial and final subwords of length n — 1. 


■^The terms vertices and edges are reserved to polytopes. 
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These graphs (together with their analogues for larger alphabets) were introduced 
independently by de Bruijn [dB] and Good [Go], and are relevant to many problems 
in combinatorics and applied mathematics: see [Man] and references therein. The 
first four de Bruijn graphs are pictured in Fig. 1. 



Figure 1. 


The de Bruijn graphs Gn for 1 < n < 4. 


Any orbit of the one-sided shift ct: —>■ fl induces in the obvious way a semi¬ 
infinite walk on the de Bruijn graph Gn, called the Gn-itinerary of the orbit. If the 
orbit is periodic then we obtain a closed walk of length equal to the period. If /r 
is the measure supported on the periodic orbit, we denote the induced closed walk 
on Gn by Wn{^i)- The following sets of measures will play an important role: 

Cn := {/i G Ma periodic ; Wn{p) is a cycle in G„} , 

or equivalently 

Cn — {m G AIct periodic ; each cylinder of level n — 1 contains 
(3.5) . 

at most one point of supp p,| . 

There is an obvious bijection between Cn and the set of cycles in Gn- The initial 
terms of the sequence (C„) are: 

Cl = {0, T} c C2 = {0, T, OT} c C3 = {0, T, OT, OOT, OTT, OOTT} , 

where 01 , for example, denotes the periodic measure supported on the orbit of the 
point (0,1,0,1,...) G n. 

Remark 3.3. The nested union C„ equals the set of periodic measures in Ma- 
Following Maurer [Mau], we define the recursive complexity of a periodic measure 
pi as the least n such that pi G C„.® Recursive complexity is not monotonic with 
respect to the period: for example 000111 and 01011 have recursive complexities 4 
and 5, respectively. 

Remark 3.4. It seems to be a difficult task to count precisely the number of elements 
of Cn (or equivalently the number of cycles in G„): see [Mau] for related results. 
Let us mention a celebrated result by de Bruijn [dB]: for n > 2, there are exactly 


®Note that Maurer’s definition is slightly different since his G„ is our Gn+i- 
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22 n +1 Qy(jigg Qf maximal length 2” these are the Hamiltonian cycles 

of Gn, i.e., the cycles that visit every node.® 

3.5. The structure of the rotation sets i?„. Let us use the above material to 
describe precisely the rotation sets i?„ = 7r„(Atcr) introduced in §3.2. The following 
result is a finer version of Proposition 3.1: 

Proposition 3.5. For all n > 1, the rotation set Rn is isomorphic to the circulation 
polytope of the de Bruijn graph G„. Moreover, Rn has the injectivity property: the 
restriction o/7r„ to the set Cn is a bijection onto the vertices of Rn- 

Proof. Suppose p G Rn, that is, x = 7r„(/x) for some p G Ato-. The vector x gives a 
weight to each cylinders of level n and thus to each arc of Gn ■ We claim that this 
weight function is a circulation. Weights are obviously nonnegative and sum to 1, 
so we are left to verify Kirchhoff’s current law. Given any node w, that is, a word 
of length n — 1, we have the following disjoint unions: 

[w] = [wO] U [wl], CT“^([a;]) = [Ow] U [Iw]. 

So /r([u;]) is the total weight leaving the node w, while /r(cr“^([w])) is the total 
weight entering it. They are equal by invariance of the measure. So Kirchhoff’s 
current law is satisfied, and we have shown that x induces a circulation on Gn- 
Conversely, let us check that every circulation on is of the form above. This 
is obvious if the circulation is supported on a cycle, and since a general circulation 
is a convex combination of circulations supported on cycles (by Proposition 3.2), 
the claim follows. 

The injectivity property is clear, and the proposition is proved. □ 

Proposition 3.5 permits us to deduce other properties of rotation set Rn- For 
example, the dimension of i?„ is 2"“^.^ The face structure of can be studied 
using Proposition 3.2: an example is shown in Fig. 2. Since G„ has 2" arcs, it 
follows from Propositions 3.5 and 3.2 that has no more than 2^ faces. 


0 



0 or 1 0 




Figure 2. The polytope R 3 has 6 facets: 4 tetrahedra and 2 pyramids 
(pictured). The total number of faces is 1-1-6 + 13-1-13 + 6+1 = 40. 

We close this section mentioning other facts that, despite not being needed for 
the rest of the paper, have independent interest. 

^Incidentally, these cycles are exactly the lifts of the Eulerian closed walks on Gn—i- 

^Sketch of proof: Consider circulations of Gn as vectors in . Then R„ = A n K, where A 
is the standard unit simplex in R^ , and K is the vector subspace corresponding to Kirchhoff’s 
current law. One checks that K has codimension 2"“^ — 1 and is transverse to A. It follows that 
dimitn = 2"-L 
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Remark 3.6. In view of the characterization (3.4), the computation of the ergodic 
supremum of a function in Sn becomes equivalent to a maximum cycle mean problem 
for the de Bruijn graph Gn- These problems have been studied in the combinatorics 
optimization literature, and efficient algorithms are known: see [Ka, DIG].® Graphs 
with random weights on the arcs had also been studied: see [MWj. 

Remark 3.7. The Poulsen simplex AIo- can be recovered from the sequence of poly¬ 
topes (Rn) as an inverse limit. 

Remark 3.8. Using the appropriate graphs, it is straightforward to adapt the results 
above to general subshifts of finite type and in particular reobtain the results of [Zi] . 

4. A GAP CRITERION FOR THE LOCKING PROPERTY 

4.1. Statement of the criterion. Consider a step function g G Sn of level n>l. 
By the facts discussed in the previous section, in terms of notation (3.3) we have 
erg sup g = {{g,v)) for some vertex v of the polytope i?„. Define the n-gap of g as 
(see Fig. 3): 

gap„(g) := {{g, v)) - max {{g, w)). 

w vertex oi Hn 
w^v 

Note that in the expression above it is sufficient to consider the vertices w such 
that [n,?!;] is an edge of the polytope i?„. 

Let /i be the measure in such that v = 7 r„(/i). Then erg sup g = ((;, /i), i.e. g 
is a maximizing measure for g] moreover, g G Cn, and in particular it is a periodic 
measure. The n-gap can be written as: 

gap„( 5 ) = {g, g) - max ( 5 , v). 

Note that ga.p^(g) is positive if g has no other maximizing measure in Alcr, and is 
zero otherwise. In the former case, every g G Sn sufficiently close to g also has an 
unique maximizing measure, which is g. 



Mevel sets of ((<?,■)) 


Figure 3. Gap of a function g G Sn. 

Recall that Anf denotes the truncated Haar series (2.3) of a function /, and so 
it belongs to Sn. 

The following lemma plays a pivotal role in this paper: 


®It is interesting to note that Karp’s algorithm relies on a graph-theoretical version of Maiie 
lemma. In this regard, a function / on the set of arcs is called a coboundary if there exists a 
function h on its set of nodes such that f(a) = h{n\) — h(no) whenever a is an arc from node no 
to node ni. 
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Lemma 4.1 (Gap criterion). Let f G C°{Ll) andn G l\l. Assume that the following 
gap condition is satisfied: 

OO 

(4.1) gap„(A„/) > V(A: -n+1) max |c<^(/)| . 

Then the maximizing measure for Anf (which is unique and periodic) is also the 
unique maximizing measure for f. 

In informal terms, if the tail of the Haar series is small compared to the gap of 
its initial part, then it does not influence the maximizing measure; in particular 
property P is satisfied. The lemma actually permits us to obtain the stronger 
locking property: 

Corollary 4.2 (Gap criterion version 2). Consider a sequence a = (a„) \ 0 such 
that: 

OO 

(4-2) E na„ < OO. 

n—0 

If a function f G satisfies the gap condition (4.1) for some n G N then 

/e U. 

Proof. In view of estimate (2.9), assumption (4.2) implies that, for every n G N, 
the RHS of (4.1) converges for any function / G C°‘{Lt), and moreover it depends 
continuously on /. The LHS is also continuous, so the functions / in the space 
C“(ll) that satisfy the gap condition (4.1) form an open subset 'Z4i. By Lemma 4.1, 
each / G 'Un has a unique maximizing measure, which coincides with the unique 
maximizing measure for Anf and so is periodic. As observed before, maximizing 
measures of functions in Sn are locally constant whenever the gap is positive. It 
follows that 'Un C La, as we wanted to show. □ 

The proof of Lemma 4.1 will occupy the rest of this section, and requires a few 
other lemmas. 

4.2. Basins of repellency. Given p, G Ma and fc G N, let C 11 be the union 
of all cylinders of level k that intersect the support of p. 

Lemma 4.3. If p G Cn then for all s gN we have: 

S 

fJ ^ — Pfi,n+s • 

j=0 

Proof. If a; G B^^n+s then the cylinder of level n + s that contains x intersects 
supp p at some point z. So the n + s initial symbols of x and z coincide. It follows 
that for each G {0,..., s}, the cylinder of level n that contains {x) also contains 
showing that x G <7~^[B^j^^n)- 

Conversely, consider a point y G n^=o Then for each j G {0,..., s}, 

the cylinder Cj of level n that contains a^{y) intersects supp^ at some point z^^f 
If j < s then the set a{Cj), being a cylinder of level n — 1 that intersects the higher 
level cylinder Cj+i, must contain it. In particular, the cylinder cr{Cj) contains both 
points and a{z^^'>) from supp/r. Using the hypothesis p G Cn and recalling 

(3.5), we conclude that = a(z^^^), for each j G {0,..., s — 1}. Hence the set 


16 


JAIRO BOCHI AND YIWEI ZHANG 


D := Hj^o ^ '’ (G )j which is a cylinder of level n+s containing y, also contains 
and therefore is contained in B^^ n+s- So y € as we wanted to show. □ 

As a consequence of the lemma, if ^ S C„ then H^o“ supp/r. 
Therefore we say that the set is a hasin of repellency for the periodic orbit 
that supports y. 

Let c denote the complement operation, that is, := 17 \ B. 

Lemma 4.4. Given measures fi G Cn and ^ € Ck with n < k and per(/r) < per(^), 
and a function g G Sk, we have 

{ 9,0 - { 9 , 9 ) < 2(fc-n + l)||g||ooC(-B;;,„). 

Note that n < k does not imply that per(/i) < per(^) in the statement above; 
an example appears in Remark 3.3. 


Proof of Lemma 4-4- Write m := per(/r) and £ := per(^). The step function g G Sk 
induces a function on the arcs of G/c, which by simplicity we denote by the same 
symbol. Let A and B be respectively the sets of arcs in the cycles Wkiy) and Wk{C) 
(in the notation from §3.4). So ffA = m and ffB = t. Let i := #(A fl B). Then: 


{ 9 ,0 - ( 9 , 9 ) = ~ SW) 

I m “ 

a^B a^A 

= J Sia)-^ 9{c 


< 


a^B\A 

t — i m — i 


aeA-^B 


+ 


m 


llfflloo. 



XI 9{a) 

a^Ar\B 


By assumption, m < £, and so the quantity between curly brackets is 2{£ — i)/£ = 
2f{B'^j,). Using Lemma 4.3 and invariance of 


as’f^k) = ? 



< ik-n + l)^{Bl ). 


The lemma follows. 


□ 


4.3. Proof of the gap criterion. Fix / G G°(17) satisfying the gap condition 
(4.1) for some n G N. Let y G Cn he the maximizing measure for A„/, and fix 
an arbitrary v G Ada-- We need to prove that (/,/r) > (/, with equality only if 
V = y. 

For each integer k > n, we will deal with the basin of repellency Bk,fi introduced 
in §4.2, which we will denote by Bk for simplicity. 

For each k > n, consider convex combinations of the form 

(4.3) i'k='^Pk{f)f 


(so the numbers Pkif,) are nonnegative and their sum over ^ is 1) such that 
(4.4) TTkil'k) = T^kil')- 
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Such combinations always exist, but uniqueness may fail. We choose and hx a 
combination such that the following quantity: 

(4.5) Mfe := ^ PkiO^iBn) 

CeCfc 

per(J)>per(/i) 

is minimal among all possible decompositions; its existence follows from a compact¬ 
ness argument. 

For example, if u = /3 (the unbiased Bernoulli measure) and per(/i) = 4 then 
Us = 0, because we can take U 3 as a convex combination of of measures in C 3 of 
periods less than 4, namely U 3 = |0-|-|l-|-| 001 -I- | Oil. 

Lemma 4.5. The sequence (uk)k>n is monotone nondecreasing. 

Proof. Fix k > n. For each 77 G Ck+i^ write the circulation TTkiv) s-s a convex 
combination of vertices of Rk' 

(4.6) TTkiv) = ^ TTkiO ■ 

«eCfc 

Note that 

(4.7) per(^) > per( 77 ) ^ g^(^)=0; 

indeed for each £ Ck the closed walk kFfe(^) visits exactly per(^) nodes of Gk, 
while the closed walk Wk{ri) visits at most per(? 7 ) nodes. 

On the other hand, 


7rfc(u) = 7rfe(7rfc+i(u)) = TTfc ^ ^^+1(77)77 = ^ 77^+1(77)71^(77). 


. j?eCfe+i 


y/CCfc+i 


Substituting the expressions (4.6) and manipulating, we obtain: 


7 rfe(u) = 7 rfe(u), where — X! X! Pk+iiv)QviO ] ^^kiO 

CeCfc \r]&Ck+l 

which should be compared with (4.3) and (4.4). Therefore: 


Uk < E I E Pk+iiv)(lviO ) ^(Bn) (by minimality of (4.5)) 

^y/CCfc+i 


ieCk 

per(5)>per(/x) 


< 


E ^ Pk+i{v)qviO 


r)GCk+i 

per(ry)>per(^) 


(by (4.7)) 


E Pk+liv) 

i^eCfc+i 

per(r;)>per(ii) 


E 

ieCk j 


(Bn) 


E Pk+l{v)viBn) 

riGCk+i 

per(ry)>per(/i) 


(using (4.6), XB„ £SnCSk, (3.2)) 


— ^fc+ 15 

thus proving monotonicity. 


□ 
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We are ready to prove the gap criterion: 

Proof of Lemma f.l. Let /, n, /i, ly be as explained above. Write 

OO 

f = Anf where gk ■= ^ Cu,{f)K . 

k—n \uj\—k 

Then, using the projection property (3.2) together with (4.3) and (4.4), we have: 

OO 

if, L-v) = {Anf, g.-v) - ^{gk,v - ii) 

k—n 

OO 

= - Vn) - '^{gk,Vk - g) 

k—n 

OO 

(4-8) = '^Pn{i){Anf,g-f) - ^ '^Pk{f,){gk,^-g) ■ 

feCn fc=n jGCfc 

V-.-^ V-,-^ 

® 

We will estimate the terms (l) and (2)^ from below and from above, respectively. 

On the one hand, by definition of the gap, for all ^ G Cn different from /i we 
have {Anf,g — 0 ^ E^PniAnf), and therefore 

® > (1-p„(®)gap„(^„/) > (l-u„)gap„(A„/). 

On the other hand, if ^ S Cfc has per(^) > per(/r) then by Lemma 4.4, 

{gk,^-g) < 2\\gk\\oo^{Bl) < 2{k - n + l)\\gk\\oo ■ 

In the case that per(^) < per(^) we use the trivial estimate 

{gk,^- g) < 2||5fc||oo < 2(fc - n + l)||gfc||oo ■ 

So we have: 

@fe < 2(/c-n+l)||gfe||oo I ^ Pk{C) + X! PkiO^iBn) > 

eeCfc CGCio 

l^per({)<per(/j) per(C)>per(/i) 

= 2{k-n + l)||gfc||oo (1 - Uk) 

<2{k-n + l)||gfe||oo (1 - Un), 

where in the last step we used Lemma 4.5. Substituting these estimates into (4.8), 
we obtain 

OO 

{f,g->^)>S{l-Un), where 5 := gap„(A„/) - ^ 2(/c - n + l)||5fc||oo • 

k—n 

Note that ||gfc||oo = ^supi^j^j, |c^(/)|, and so assumption (4.1) means that S > 0. 
Moreover, by definition < r'„(i?„), so 

{f,g-v) > 6iyn{B^)- 

In particular (/,/r — v) >0. Assume that equality holds; then Vn{Bn) = 1- Since 
Bn is a basin of repellency for /i, we have Vn = g- So 7r„(i/) = 7r„(/r) and by the 
injectivity property of the rotation set i?„ we conclude that v = g. This proves 
that /i is the unique maximizing measure for /, as claimed. □ 
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5. Proof of prevalence 

In this section we prove Theorem B which, as explained before, yields Theorem A 
as a corollary. 

A box <5 C is a product Ji x • • • x/^ of compact intervals of positive lengths, the 
least of which is called the thickness of the box and is denoted by t{Q). Lebesgue 
measure on is denoted by Lebd- We will estimate the measure of slices of a box, 
as in Fig. 4. 



Figure 4. Slicing a box. 


If L is a linear functional on R'^, say L{x) = then its oo-norm is 

||L||oo := maxi |Ai|. 

Lemma 5.1. Let Q C R^ fee a box. Let tp: R'^ —^ R fee an affine function with linear 
part L ^ 0. Then, for all <5 > 0, 

Lebddcc G Q ; \ip{x)\ < fe}) ^ 25 

LehdiQ) “ ||A||oor(Q) ■ 

Proof. Let Q = Ii x ■ ■ ■ x Id and L{x) = J2i=i Without loss of generality, as¬ 
sume that ||L||oo = |Ai|. For each y G hx - • -xld, the section {xi G Ji ; \ip{xi,y)\ < 
(5} has measure at most 25/\Xi\. So, by Fubini theorem, 

Lehdijx G Q ; |y?(x)| < ^}) ^ 25 hehd-ijh x • ■ ■ x J^) ^ 25 

Lebd(Q) - |Ai| Lebd(Q) -\\L\\^t{Q)- 

Proof of Theorem B. Fix an evanescent sequence a, and an admissible gauge b 
with respect to a. Fix an arbitrary /o G C“(n). Recall that our aim to prove that 
for Rb-almost every function g in the Hilbert brick Tib, the function f = fo + 9 
has the locking property. For that purpose we will use the gap criterion version 2 
(Corollary 4.2). Note that hypothesis (4.2) is automatic from evanescence. 

For each n G l\l, let 

OO 

5„-.= y^ {k-n + I)(afcLip„(/o) -|-fe^) and 

k—n 

@n ■■= {g GPtb ; gap„ {Affifo + g)) > (5„} . 

In view of (2.9) and the definition of the Hilbert brick T(b, the number 5n is an 
upper bound for the RHS of (4.1) for all functions of the form f = fo + g with 
g G Tib- So by Corollary 4.2 we have: 

(5.1) /o + 5 G La . 

n 

So in order to prove Theorem B it is sufficient to show that Rf,(^r!,) —>• 1 as n —>■ oo. 
Fix a positive integer n. Let dn := 2" — 1 and let 7r„: (^“(H) R'^" be the map 
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that associates to a function / its Haar coefficients of level less than n (the initial 
coefficient c(/) being disregarded). Then TVni'Hb) equals the box 

Qn — 7 

luil<n 

moreover the push-forward of Pb under 7r„ is the normalized Lebesgue measure on 
Qn- For each measure /j, on SI, define a linear map —>■ D? by 

^ ^ (^uj ; ■ 

CJ 

Then (Anf^n) = c{f) + 4(7r„(/)) for every / e C“(Sf). 

Let Gn '■= ^n{@n)- Note that @n = because the definition of only 

involves the initial Haar coefficients. Write ^n = \ Qn and = Qn ^ Gn- If 

g S Qn then there exist distinct elements /r, v of Cn such that 

\{An{fo+ v)\ < Sn- 


Moreover, p, v can be chosen as neighbors in the sense that [7r„(/i), 7rji(iz)] is an 
edge of the polytope i?„. It follows that 

GnC. y ja; G ; |(4 --f a;)| < (5„| , 

neighboors 


where := 7r„(/o). In particular, by Lemma 5.1, 


(5.2) 


PbiQ^n) 


Lebd„(G;)) ^ 2Sn \\T - T 

Lebd„(Q„) “ T{Qn) ^ 

neighboors 


Given g- ^ v in Cn, there exists a node of the graph Gn that is visited by the 
cycle Cn{g) but not by cycle Gn{v)- That is, there exists a word w of length n — 1 
such that 

{hoj,g) = - -and (/i^,i/) = 0. 

2per(/r) 

We have per(/r) < 2”“^ (since the cycle Gn{g) contains no repeated nodes), and 
therefore ||/^ — /,y||oo > 2“". On the other hand, the number of pairs of neighbors 
in Cn is equal to the number of edges of the polytope i?„. We have seen 
in §3.5 that has no more than 2^ faces of all dimensions. Substituting these 
estimates back in (5.2) we obtain: 

(5.3) Vb{Ql) < 2"”+”+M„/r(Q„). 


Recall the evanescence and admissibility conditions (1.2), (1.4), which can be rewrit¬ 
ten as: 

an = 0 (2~‘^"'^\n-i^ , bn = o(a„), On = . 

Then: 

OO 

Sn = '^{k - n + l)O(afe) = 0(a„) = O ^2“^ ^ a„_i^ . 

k—n 

bn = o{an) = O (2-^''^\n-i) = O 6„_i) . 


Also, 
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In particular, if n is large enough then niin{6g,6]^,..., &„_i} = that is, the 

box Qn has thickness T{Qn) = k^-i- Substituting these estimates back in (5.3) we 
obtain: 

= 2-2"+0(-) = o(l). 

As remarked before, it follows from (5.1) that fo + 9 & for Pb-a.e. g, as we 
wanted to prove. □ 


6. Directions for future research 

The results of this paper motivate a number of questions. 

Consider the property of uniqueness of the maximizing measure (which is of 
course weaker than the locking property). This property is known to be topolog¬ 
ically generic in many function spaces: see [CLT, Je3]. So it is natural to ask 
whether the uniqueness property is prevalent in function spaces larger than those 
considered in this paper. 

The most obvious open question is whether our prevalence results hold on larger 
function spaces. The reader will have noticed that several estimates in the proof 
of Theorem B are very crude. Nevertheless, it is not clear how the evanescence 
condition (1.2) could be significantly weakened. One of the obstacles is the fact 
that low-period vertices in belong to many edges: for example the two vertices 
of period 1 are connected by edges to all other vertices. Such “edgy” vertices give 
rise to many potential gaps to be controlled. On the other hand, those vertices 
seem to be “pointy” and should typically generate large gaps, though we have not 
been able to formalize this. On the other extreme we have the vertices of maximum 
period, namely the hamiltonian cycles (see Remark 3.4). These vertices seem to 
be “blunt”. On the positive side it can be shown that each of them belongs to 
exactly 2”“^ edges, which is the minimum number allowed by the dimension of the 
polytope. To summarize, an improvement of our methods seems to require a finer 
understanding of the geometry and the combinatorics of the polytopes i?„ (or their 
duals). 

Ultimately one would like to go beyond the locking property and formalize the 
conjecture of Hunt and Ott [HO] (mentioned in Remark 1.2 above) that maximizing 
measures typically have low period. 
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